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Abstract: This paper introduces a new framework to study the asymptotical behavior 
of the empirical distribution function of Gaussian vector components, whose correlation 
matrix r'™' is dimension-dependent. Hence, by contrast with the existing literature, 
the vector is not assumed to be stationary. Rather, the covariance matrix T*'") should 
be close enough to the identity matrix as m grows to infinity. Markedly, under this 
assumption, the convergence result depends on r'"*^ only through the sequence 7™ = 
r^™' . This result recovers some of the previous results for stationary long- 
range dependencies while it also applies to various non-stationary cases, for which the 
most correlated variables are not necessarily next to each other. Finally, we present an 
application of this work to the multiple testing problem, which was the initial statistical 
motivation for developing such a methodology. 
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1. Introduction 

1.1. Presentation of the main result 

Let us consider {Y^'^\m > 1} a triangular array for which each y^*") = {y[™'\ . . . ,1^™^) is 
a m-dimensional Gaussian vector, defined on ($7^1 -^mj IPm); with zero mean and covariance 
matrix r^'"^ For the sake of simplicity, assume that each y.^™^ is of variance 1, that is, 
r[™) = 1 for all i. Denote $(z) = P(Z > z), z G M, Z ~ AA(0, 1), the upper tail distribution 
function of a standard Gaussian variable, and consider the empirical cumulative distribution 
function: 

m 

F™(t)=m-iJ]l{$(y/"^))<t}, tG[0,l]. (1) 
1=1 

In this paper, the motivation for studying the asymptotic properties of (1) in a possibly non- 
stationary setting is the multiple testing problem, which pertains to the recent and flourishing 
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field of statistics for higli-dimensional data. In such data, while the dimension m can be very 
large (typically, several thousands), the matrix T^™'^ is generally complex and not-necessarily 
locally structured (e.g., spiked correlation matrices, see Johnstone, 2001; Friguet et al., 2009). 
To study (1), let us introduce the following quantities: 

7^ = m-2^r(-); (2) 
r-m = i'm'^ + |7m|) • (3) 



In a nutshell, our main result is as follows: by assuming, when m — ?• oo, 

~^ ^' (vanish-secondorder) 

4+eo / , .X 4 

-f^Y^i^il) for some eo>0; (Hi) 

iTT'lm — ^ ^1 for some 9 E [—1, +oo]; (-^^2) 

the following weak convergence holds (in the Skorokhod topology): 

rmi^m -/)--> Z, as m 00, (4) 

where I(t) = t and Z is some continuous Gaussian process on [0, 1] with a distribution only 
function of 9. Therefore, under our assumptions, the asymptotic behavior of solely depends 
on the sequence (7m)m- In particular, the convergence rate is of order m^/^ when mjm is 
bounded (i.e., 6 < +00) and is of smaller order otherwise. 



1.2. Organization of the paper 

In Section 2, we derive an expansion of the covariance function of F^, by using the Hermite 
polynomials and we show that (vanish-secondorder) ensures that this expansion "stops at or- 
der 1 asymptotically" . The main theorem is precisely stated in Section 3 together with many 
illustrative examples, showing that our assumptions cover many standard models of correla- 
tion matrices (e.g., equi-correlated, long-range, spiked correlations, sample correlations). This 
new methodology is then applied to the multiple testing problem in Section 4. The proof of 
the main result is presented in Section 5; it mainly relies on central limit theorems for mar- 
tingale arrays and on a suitable tightness criterion. To make the proof as clear as possible, 
some technical and auxiliary results are deferred to appendices. 



1.3. Relation to existing literature 



A colossal number of work aimed at extending Donsker's theorem (Doob, 1949; Donsker, 
1952; Dudley, 1966) to a more relaxed setup. Among them, a particularly prospering research 
field deals with the introduction of weak dependence between the original variables, mainly 
by using mixing conditions. Here, we do not attempt to provide an exhaustive list for such 
results and we refer the reader to, e.g., Dedecker and Prieur (2007); Doukhan et al. (2010) 
for a detailed review. 
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When restricted to the Gaussian subordinated setting, asymptotics for the e.c.d.f. (1) are 
described in the two well-known papers of Dehling and Taqqu (1989) (long-range) and Csorgo 
and Mielniczuk (1996) (short-range). However, both studies make a stationarity assumption, 
that is, assume that the covariance matrix is of the form T^™^ = r(|i — j|), 1 < i,j < m, 
for some function r(-) vanishing at infinity and not depending on m. Here, the setting is 
markedly different: first, the covariance matrix T^"^^ is allowed to dependent on m, that is, 
the y^'^^'s form a triangular array of Gaussian variables. Second, r^™) need not be locally 
structured, that is, r^™^ is not necessarily related to the distance between i and j. Instead, our 
conditions are permutation invariant, that is, are unchanged when permuting the columns of 
the triangular array. This is quite natural because the e.c.d.f. is itself permutation invariant. 
Third, the correlations considered in our framework are possibly negative (although in a 
reasonable amount to keep the overall covariance matrix nonnegative) and our approach 
shows that the negative correlations can decrease the asymptotic covariance or even increase 
the convergence rate. 

Compared to the existing literature, our framework covers a part of the previous results 
(e.g., includes long-range of Dehling and Taqqu (1989) but excludes short range of Csorgo 
and Mielniczuk (1996), see Section 3.2) while it opens a window for other dependence models, 
as, for instance, the spiked correlation model introduced by .Johnstone (2001). The latter is 
of primary interest for modeling microarrays data, for which few external variables (factors) 
are able to have a small influences on all the measured l^'s, see Friguet et al. (2009) and Fan 
et al. (2010). Typically, a "benchmark" sub- model for our approach is the equi-correlated case 
where vf^^ = pm, i ^ j, for some correlation tending to zero (at some arbitrary rate). Let 
us note that we have studied this special case in an earlier work, see Delattre and Roquain 
(2011). 

Finally, let us mention that recently, Bardct and Surgailis (2011) have also removed the 
stationarity assumption, by establishing central limit theorems (CLT) for Gaussian subordi- 
nated arrays. There are two major differences with our work: first, they deal with a CLT for 
the partial-sum process and not with a functional CLT for the e.c.d.f. Second, their assump- 
tions are not of the same nature, because they require that jF-'^^l < r(|i — j\) for all i, j, for 
some function r(-) vanishing at infinity. 

2. Preliminaries: first order approximation for the covariance of 

Throughout the paper, to alleviate the notation, we win often denote P„^ by P, by Y 

and r'-™'^ by F when not ambiguous. 

Let us consider the sequence of Hermite polynomials Hi{x), ^ > 0, x € M (see Appendix B). 
By using Melher's formula, the covariance function of the process Fm(-) can be described as 
a function of the correlation matrix F of Y . 

Proposition 2.1. Consider¥m{) the process defined by (1) and the function family {ci{) ,£ > 
1} defined by 

ce{t) = H,_^{<^-\t))cl){^~Ht)), [0,1], ^ = 1,2,..., (5) 
where cj) = is the standard Gaussian density. Then for all t,s G [0, 1], we have 

Cov(F,„(i),F„(.)) =Y^-Jm^ (m-^Y.(^,A ■ (6) 
e>i ■ \ i,j ) 
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This result can be found, e.g., in Proposition 1 of Schwartzman (2010) (itself referred to 
Theorem 1 of Efron (2010)). We provide a proof in Appendix B for completeness. While (6) 
is an exact expression, we can try to approximate the covariance Cov{¥m{t),¥m{s)) when m 
grows to infinity, while making some assumption on the matrix T = T^"^h 

Firstly, let us note the following: since m" 
(by using (72) in Appendix B), expression (6) shows that the following conditions are 
equivalent as m tends to infinity, 

VtG [0,1], Var(F„(t)) = o(l) (7) 

V^> 1, m-2^(r,,,)^ = o(l) (8) 

2 



m 



= oil) (A) 



As a consequence, condition (A) is required as soon as a convergence result of the form (4) 
holds. Note that the rate defined by (3) satisfies 1 < (m~'^ Ylii )(rj,j)^) ^ r„i < \/m. 



Hence r.^ tends to infinity under (A) but not faster than \Jrn. 
Secondly, let us rewrite (6) as follows: 

Cov(F„(t),F„(s)) = m-i(t Ns-ts)+ 7mCi(t)ci(s) 

+ E c,{tyi{s){t'r\ (9) 

where 7m is defined by (2). The latter holds because, for two independent 7\^(0, 1) variables U 
and F, we have mT^ Yl,i>\ cKOcKs)(^0"^ = Cov(l {4>(C/) < t}, 1 {^{V) < s}). In expansion 
(9), the second order term (i.e., the sum over £ > 2) is negligible w.r.t. the other terms if 
(vanish-secondorder) holds. Hence, assuming now (vanish-secondorder) , we obtain that the 
rescaled covariance Cov{rm¥m{t),rm¥rn{s)) of rm^m converges to the following covariance 
function 

^) = TTW\^^ A s - ts) + ^-^ci(t)ci(s), (10) 

where 6 is defined in {H2) and where we use the conventions 0/(1 + |0|) = 1 and 1/(1 + |0|) = 
when 6 = +00. Note that (^^2) always holds up to consider a subsequence, because rwym > — 1 
from the nonnegativeness of T^"^\ 

Remark 2.2. One should note that, in the RHS of expression (10), the second term is not 
necessarily a covariance function because 9 can he negative. Nevertheless, K can he written 

k{t,s) =t ^s -ts - ci{t)ci{s) (11) 



as K{t,s) = j:KgTK{t,s) + Y^^ci{t)ci{s) , where 



turns out to he a covariance function; considering a Wiener process {Wt)te[o,i], K is the covari- 
ance function of the process Wt — tWi — ci{t) ^~^{s)dWs, which is the orthogonal projection 
in LP' of Wt onto the orthogonal of the linear space spanned hy Wi and ^~^{s)dWs. In- 
terestingly, the latter also shows that the original covariance K given hy (10) can he seen 
as the covariance function of Zt = (1 + \e\)-'^/'^ {Wt - tWi) + (1 + \e\Y'''\{l + (9)^/^ - 
l)ci{t)ji^-\s)dWs. 
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3.1. Statement 

Our main result establishes that the convergence of the covariance functions investigated in 
Section 2 can be extended to the case of a weak convergence of process. For this, we should 
additionally assume (Hi) (which is related to the tightness of the process). 

Theorem 3.1. Let us consider the empirical distribution function ¥m defined by (1). Assume 
that the covariance matrix F^™) depends on m in such a way that (vanish-sccondordcr) and 
{Hi) hold with r„i defined by (3) and assume {H2). Consider {'^t)t<^[o,i] 0, continuous process 
with covariance function K defined by (10). Then we have the convergence (in the Skorokhod 
topology) 

r-mi^m — /) Z, as m ^ 00, (12) 

where I{t) = t denotes the identity function. 

A striking point is that (12) depends on F only through the sequence 7^. In regards to the 
limiting covariance function (10), there are two possible regimes: 

(i) if < +00, the process m^/^(Fm — I) converge to a (continuous Gaussian) 
process with covariance function given by (i, s) ^tAs — ts-\-6 {t))4>{^~^ (s)) . 
Hence, the limit process is a standard Brownian bridge when ^ = 0, but has a covariance 
function smaller (resp. larger) if < (resp. 6 > 0). 

(ii) if — )• = +00, the "Brownian" part asymptotically disappears and (7™) ""^^^(I^m — 
/) converge to the process 4>{^~^{-))Z for Z ~ M{0, 1). 

The above behaviors are illustrated in Figure 1: as rwym grows, the "Brownian" part dis- 
appears and the process looks like function (randomly rescaled). Also, the y-axis 
indicates that the scaling ^/m is not suitable for large values of rwym- 

Remark 3.2. Assumptions (vanish-secondordcr) and {Hi) always hold under the following 
condition 

< CLm for all i j and Om satisfies m^^^a^ — )■ for some (5 > 0. (13) 

Remark 3.3. In Theorem 3.1, the result holds by replacing the set of assumptions (vanish-secondorder) , 
{Hi) and {H2) by the two following one: 

with So > 0; {H3) 

with £0 > 0. {H4) 

This is proved in Section 5.5. However, note that the latter can only be useful in regime (ii) 
since {H.i) implies mj^n +00. 



m 



2 



lib 



0(1) 



+00, 
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m')m = 10 



m7„ = 10^ 





0.0 0.2 0.4 0.6 0.8 1.0 



0.0 0.2 0.4 0.6 0.8 1.0 



Fig 1. Plot oft i-s- m^''^(F„(f) - t) for some observed Y{uj). These realizations have been generated in the 
equi- correlated model (14) and for m = 10^. 



3.2. Examples 

Equi-correlation Let us start with the following simple example: 



p(m) 



/ 1 



V Pn 



Pm 



Pra 



\ 



Pn 



Pra 
1 J 



^1 Pro) Ira ~\~ Pn 



/1\ /1\ 



VI/ VI/ 



(14) 



where pm G — 1] is some parameter. We easily check that 7m, defined by (2) is given 
by = {m — l)pm and that the assumptions of Theorem 3.1 are all satisfied if pm and 
mpra converges to some 6 G [— l,+oo], which yields convergence (12). This is in accordance 
with Lemma 3.3 of Delattre and Roquain (2011). 

This simple example already shows that, following the choice of the sequence {pm)m, the 
empirical distribution function can have various asymptotic behaviors. For instance, taking 
Pm = — (m — 1)~^ gives a process in regime (i) with a minimal asymptotic covariance function 
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{9 = —1, see (11)), while taking pm ^ m leads to a rate ~ m^/^ ^ m^/^ and thus a 
process converging in regime (ii). 



Alternate equi-correlation Let us consider the covariance matrix: 

/ 1 -pm Pm ■ ■ ■ \ 
Pm 1 Pm 
Pm • • • Pm 



p(m) 



V 



Pm 
Pm 



1 

-Pm 



-Pn 
1 



/ 1 \ 



(1 - Pm)Im + Pn 



\ 1 / 



/ 1 \ 



V 1 / 



(15) 



where pm £ [—{m — 1) , 1] is a given parameter. Clearly, 7^ is such that 

m—l m—i 

mjm = 2pmm-^ ^(-'^f = pm[m/2\/{m/2) 



8=1 k=l 



Hence the rate defined by (3) is r.^ ~ y/m and assumptions of Theorem 3.1 are fulfilled 
(with 9 = 0) hy assuming that m^+'^p^ — )• 0, with 6 > (because (13) holds). Hence, under 
that assumption, ^/m{¥m — I) converges to a standard Brownian bridge. 

Maybe surprisingly, this example shows that, even if the correlations are "strong" (e.g., 
Pm ~ to be compared with the equi-correlated case), positive and negative correla- 

tions can exactly compensate each other to provide the same convergence result as under 
independence. 

Long-range stationary correlations Let us consider the correlation matrix of the fol- 
lowing form: 



-,{m) 



r[\J 



for r(0) = 1, r{k) = k-^L{k), 0<D <1, 



(16) 



where L : (0, +00) — )■ (0, +00) is slowly varying at infinity (Vt > 0, L{tx) ~ L{x) as x — )• +cxd). 
This framework is often referred to as "long-range dependence" in literature dealing with a 
stationary setup (see, e.g., Dchling and Taqqu (1989); Doukhan et al. (2002)). First, standard 
calculations easily show that for all > 0, 



m—l 



m 



2m-i^ J^F 



~ < 



i=l fe=l 

Thus, for any z^i € {D, 1), since L is slowly varying, 



21ogm 



if 1/ G [0, 1) 
if 1/ = 1 



(17) 



by applying (17), where the "um, ^ ^^m" means Um = 0{vm)- This entails ^7^''^ '^i)/(2!^i) > 



m 



(1 !^i)/2 Qj^^ thus Assumption {H4) holds. In particular. 



-1/2 



. Additionally, for any 



V2 G {D, 1) and 1^3 £ (0, 2D) such that ^3/1^2 > 1, by applying again (17), 



Im m 



2 ^ utS -2 



'Y\j- ^l"''' ^ m'^25-!.3 V {rrf^^-^ logm) 
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for any 6 > 0. We derive (H^) because taking 5 > 1 such that 6 < 1^3/1^2 and 5 < 1/^2 is 
possible. By using Theorem 3.1 under the conditions of Remark 3.3, we derive 



7^^/^(Fm - /) ci(-)Z, as m 



00, 



for Z ~ A^(0, 1). This is in accordance with Theorem 1.1 of Dehling and Taqqu (1989) (see 
in particular Example 1 therein). 

Finally, let us note that Assumption (vanish-sccondordcr) of Theorem 3.1 is not satisfied 
for a covariance matrix of the type (16) taken with D > 1 (short-range) (the other terms in 
the covariance expansion (9) are required in the limit, see Csorgo and Mielniczuk (1996)). 

Weak short/long range correlations Let us modify slightly the matrix (16), by letting: 



^i]^ = Pmr{\j - for r(0) = 1, r{k) 



-D 



D>0, 



(18) 



where pm is some nonnegative parameter (we removed the slowly varying function for the sake 
of simplicity). When pm varies in function of m, note that the latter is not of the stationary 
type. From (17), we have 



mjrn ~ < 



2p 



'»(l-D)(2-D) 

2pm log m 



k>l 



k 



~D 



if D G [0, 1) 
if D = 1 
if D > 1 



(19) 



Assuming that the quantity (19) as a limit (denoted 9) and that pm — )• as m grows to infinity, 
(vanish-secondorder) and {Hi) hold if m}^^ p^ — )• with 5 > (because (13) holds). The 
resulting rate of convergence is given as a function of D and pm in Table 1 . Markedly, weak 
short-range correlations {D > 1) always yields ~ m}/"^ while weak long-range correlations 

Z)/2+l/3 £qj. 



~ m 



{D < 1) can give both regimes. For instance, taking p^ ~ m "^^^ yields r, 
D < 1/3 and r-m ~ m^/^ otherwise. Overall, the convergence rate increases with D. 





D G [0, 1) 


D > 1 




p^m^-" = 0(1) 




e < 00 




rm ~ 








e^oo 


-1/2 D/2 

rm ^ pm rn 


not possible 



Table 1 

Rate rm defined by (3) in function of D > and pm such that pm = o{m^''^^^^^^) for some 5 > Q, for the 

particular covariance (18). 



Spiked correlation matrix model The "spiked" covariance model has been introduced in 
Johnstone (2001). It assumes that the fc-first eigenvalues of the covariance matrix are greater 
than 1 (for some fixed value of k) while the other are all equal to 1. In our setting where we 
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consider only correlation matrices, we assume that the sequence of eigenvalues is constant 
after some fixed rank k. Precisely, let us consider a matrix F^™") of the following form: 

= (1 - p^)I^ + PmPHP^, (20) 

where H is a k x k diagonal matrix with diagonal entries h^^\...,h^^^ E (l,oo), where 
P = (j'i'^^)i<i<m,i<r<fc is an m X k matrix such that P^P = Ik and where pm S [—1, 1] is 
some parameter. Importantly, k is taken fixed with m. The k first eigenvalues of r^™") are 
thus given by 1 — /9m + Pmhi^\ r = 1, . . . ,k, while the remaining eigenvalues are all equal 
to 1 — Pm- Hence, to ensure that T^"^^ given by (20) is a well defined correlation matrix, we 
should additionally assume that for all r = 1, . . . , fc, 1 — pm + Pmh^^ > 0, and that PHP^ 
has diagonal entries equal to 1, that is, for alH = 1, . . . , m, X]r=i hi"^\p\"^'')'^ = 1. Note that 

the latter requires h^^^ + • • • + h^f^^ = m and thus maxj-j/i-r™'''} > m/k. 

Next, by using (20), the conditions above and some properties of the Frobenius norm, we 
can derive the following: 



= Pmy: h^r' m-V^ > - p^, (21) 




2^ [^xn =Pm\y XK^'i^Y -i/m\. (22) 

Since the RHS of (22) is upper-bounded by p'^{k—m~^) and lower-bounded by pf^{k~'^ — m~^) 
and since k is taken fixed with m, condition (A) is satisfied if and only if p^, — ?■ while 
(vanish-secondorder) holds if and only if r^Pm — ^ 0. Additionally, we have 



1/m 




Pm(l - IM) 



where we used the Cauchy-Schwartz inequality (we dropped the dependence in m in the 
notation for short). Finally, the assumption of Theorem 3.1 are all fulfilled provided that 



r, 



m^Vm ^ with (5 > (23) 



(up to consider a subsequence making the quantity into (21) converges to some 9). In (23), 
the rate can be computed by using the definition, see (3), or expression (21). The rate 
of convergence thus intrinsically depends on the asymptotic behavior of the coordinate-wise 
mean of each eigenvector (p-™'')i<j<m • 

To further illustrate this example, we can focus on the particular case where /c = 1. In that 
case, the model can be equivalently written as 

F^'-) = (1 - pm)lm + PmCf, (24) 
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where ^ = ^^'^^ is a m x 1 vector in {—1, 1}™ and where pm £ [— (m — 1)^"*^, 1]. The model 
(24) contains as particular instances the equicorrelated matrix (^(™) = (1 1 • • • 1)-^) and the 
alternate equicorrelated matrix (,f = (1 — 1 1 • • • )^) that we have studied above. We easily 
check that condition (23) recovers the conditions that we obtained in each of theses particular 
cases. In general, for an arbitrary G l}*") since the quantity in (21) is equal to 

^-1/2 ^^Mj (25) 

the rate is directly related to the number of —1 and +1 into For instance, if = 
{Ui, . . . , Um) where Ui, U2, ■ ■ ■ are i.i.d. random signs, we have by the central limit theorem 
that the quantity (25) tends to (in probability) whenever pm — )• 0, which gives a rate 
fm ~ (in probability). Hence, we obtain the convergence (12) with the same rate and 
asymptotic variance as in the independent case whenever m^'^^p'^ — )• with 6 > 0. 

Sample correlation matrix We consider the model where the correlation matrix is gen- 
erated a priori as a Gaussian empirical correlation matrix. Namely, let us assume that 

^ D-^SD-^, for S = n-^X'^X and D = diag(5i,i, • • • , Sm,n,Y''^ (26) 

where X is a rim x matrix with i.i.d. AA(0, 1) entries. Assume m/rim — t- as m tends to 
infinity, which, in a statistical setup, corresponds to assume that the number m of variables 
(columns of X) is of smaller order than the sample size rim- 

A by-product of Theorem 2 in Bai and Yin (1993) (adding a number of variables which is 
a vanishing small proportion of the sample size) is that, 

P 

115* — Im\\2 — > 0, 

where || • II2 denotes the Euclidian-operator norm, that is, US' — /m||2 = maxi<j<m IaI"^ - 1| 
and A^"'^ . . . , A^^'' denote the eigenvalues of S. Hence, maxi<i<m \ Si^i — 1| -^0, which in turn 
implies Hr'-''"-* — Imlb ~> 0. Next, simple arguments entail the following inequalities: 

< (1 • • • 1)^, (r(™) -im){i---if >\< ||r("^) - imU 



4 



Moreover, we easily check that E(nm^5'jj)^ = E yrim^'^ Ylk=i ^k,iXkjJ is upper bounded 
by some positive constant. Hence, by assuming that the sequence satisfies 

m}~^^ /um — >■ for some (5 > 0, 



the above inequalities implies that the rate is ~ \/^) that {H2) holds with ^ = and that 
(vanish-secondordcr) and {Hi) are satisfied (all these convergences holding in probability). 
Hence, Theorem 3.1 can be applied and this shows that the asymptotic of the empirical 
distribution function is the same as under independence. 
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4. Statistical application: asymptotic study of Benjamini-Hochberg's procedure 
under dependence 

4.1. Two-group model and background 

Let us add to the original vector Y ~ J\f{0,T) an unknown vector H = {Hi)Ki<rn G {0, 1}"* 
as follows: for 1 < i < m, 

X, = iiHi + Yi, (27) 

for some positive number ^ (assumed to be fixed with m). Hence X ~ J\f{p,H,T). Now 
consider the statistical problem of finding H from the observation oi X = (Xj)i<j<m. From 
an intuitive point of view, H is the "signal" (unknown parameter of interest), Y is the "noise" 
(unobserved) while T and fi are "nuisance" parameters, generally assumed to be unknown. 
Obviously, because of the stochastic variations, the signal cannot be recovered exactly, hence 
we should define an error criterion that supervises the statistical decision. 

In this context, a statistical procedure is a (measurable) function with a 0/1 output for 
each 1 < i < m, that is, declaring some of the iifj's as and the other as 1. To evaluate 
the quality of such procedure, the false discovery rate (FDR) is a criterion coming from the 
multiple testing research field that has received a major and growing attention in the two last 
decades since its introduction by Benjamini and Hochberg (1995). The FDR is the expected 
value of the false discovery proportion (FDP), itself defined as the proportion of Hi equal to 
zero among the Hi deemed to be 1 by the procedure. The procedure that came up with - 
often called the "BH procedure" — suitably controls the FDR under independence of the Xj's 
(and positive dependence, see Benjamini and Yekutieli (2001)) and is nowadays commonly 
used to find signal in massive data generated from high-throughput devices (e.g., in genomics 
or neuroimaging) . However, a challenging issue often rose in recent statistical literature is the 
precise mathematical study of the BH procedure when the observed Xj's are dependent. While 
the FDR of the BH procedure is reported to be essentially unaffected by dependencies, see 
Farcomeni (2006); Kim and van de Wiel (2008), some authors showed with simulations that 
this should not be the case for the FDP, see, e.g., Korn ct al. (2004). The present work brings 
a broad theoretical support for this, by showing that the length of the asymptotic confidence 
interval for FDP is widening as the quantity jm defined by (2) grows. 

The formal link between the FDP, the BH procedure and the e.c.d.f. has been delineated in 
Gcnovcsc and Wasscrman (2004); Farcomeni (2007) (FDP at a fixed threshold) and consoli- 
dated later in Neuvial (2008) (FDP at BH threshold). Here, we follow the approach of Ncu- 
vial (2008), by using that the FDP of BH procedure is a Hadamard differentiable function of 
(rescaled) empirical distribution functions. Convergence results are thus simply derived from 
Theorem 3.1 by applying the functional delta method. Nevertheless, let us underline that we 
have made the following new interesting finding that complements the methodology of Neu- 
vial (2008): when one wants to study the FDP of BH procedure (and not another procedure), 
only the convergence of the e.c.d.f. in the one-class model of Section 1.1 is needed (and not the 
joint convergence in the two-class model). This relies on the "partial functional delta method 
on L'(0, 1)", see Proposition C.2. 
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4.2. Link with Theorem 3.1 

Let us first define the following e.c.d.f.'s: for t S [0, 1], 



Fo,m(t) = mo 1 ^(1 - Hi)l mXi) < t}; (28) 
1=1 

m 

¥i,m{t)=m^^J2Ha{HXi)<t}; (29) 

i=l 

m 

Gm{t) = V 1 {^{Xi) <t} = — Fo,m + — Fl,m, (30) 



j=l 

where mo = Y^^ii^— Hi) and mi = J^iLi ^i- The proportions mo/m and mi/m are supposed 
to converge when m grows to infinity and the limits are denoted by ttq £ (0, 1) and tti G (0, 1), 
respectively. From Section 2, when F satisfies (A), the e.c.d.f.'s Fo,m(i)) Fi^m(i) and Gm{t) 
converge in probability and we denote in what follows the limiting c.d.f.'s by Fo(t) = t, 
Fi{t) = $(^>-i(t) - n) and G{t) = 7roFo(t) + iriFi{t), respectively. 

In a nutshell, the FDP of BH procedure (at level a G (0, 1)) corresponds to the random 
variable 



m ^o,m(7'(Gm)) fmo^ mic, \ 

FDPm = a— — ^ = ^' — Fo,m, — ¥i,m , (31) 

T (a . V m m / 



mo 

T{Gm) ^ ' m 

where we used the following functionals: 



T{H) = sup{t G [0, 1] : H{t) > t/a} for H £ D{0, 1); (32) 
^{Ho, Hi) = aHo{T{Ho + Hi))/T{Ho + Hi) for {Hq, Hi) G D{0, 1)^, (33) 

and where we used the conventions sup{0} = and 0/0 = 0. By Corollary 7.12 in Ncuvial 
(2008), T is Hadamard differentiable at function G, tangentially to the set C(0, 1) of contin- 
uous functions on (0, 1) and w.r.t. the supremum norm (we refer to Section 20.2 in van der 
Vaart (1998) for a formal definition of Hadamard differentiable functions). This holds because 
G is strictly concave and limt_^o G(t)/t = +c«, which yields in particular T{G) G (0, 1). As 
a consequence, standard calculations show that \I' is Hadamard differentiable at {■kqFq,ttiFi) 
tangentially to C(0, 1), with derivative 

^inoFo,n,Fr){Ho,Hi) =a^^'^^, for {Ho, Hi) G C7(0, 1)^. (34) 

Now, by using (31), the functional delta method provides the asymptotic behavior of FDPj^^ 
from the one of (^Fo,m, ^¥i,m)- As a matter of fact, since the derivative 'if(^pg p^){HQ,Hi) 
only depends on Hq while the limit processes are (a.s.) continuous, establishing convergence 
results separately for Fo,m and Fi^^ is sufficient (we do not need to consider the joint process 
(^Fo,m) ^Fi^m))- We have precisely formulated this argument in Proposition C.2. Hence, 
applying twice Theorem 3.1 (for Fo,m ~ Fmg and Fi^m ~ Fm^ o Fi), we are able to derive a 
convergence result for FDPm, as we establish in the next section. 



imsart-generic ver. 2007/09/18 file: DR2012-vl.tex date: October 10, 2012 



Delattre, S. and Roquam, E./Asymptotics of e.c.d.f. for Gaussian arrays 

4.3. Results 

First, let us introduce the following additional quantities: 



13 



ro,r. 



ri,r. 



mo + 



-1/2 



-1/2 



(35) 



(36) 



Corollary 4.1. Consider the two-group model (27), generated from parameters fj,, H = 
H^'^^ and a correlation matrix T = T^'^\ Assume that niQ (depending on H) is such that 
^/rn{mQ/m — ttq) — t- 0. Assume thatV satisfies (vanish-secondorder) , [Hi) and that the rates 
fm, ^o,m cLnd ri^m, respectively defined by (3), (35) and (36), grow proportionally to infinity 
as m tends to infinity. Let a E (0,1) and t* = t*{iJ,,a) be the unique t E (0,1) such that 
G{t) = t/a. Let h{t*) = {(l){^-^{t*))/t*f . Then the sequence of r.v. FDPm defined by (31) 
enjoys the following convergence: 



FDPm -TToa 



7roa{{l/t^ - l)/mo + ^(t*)7o,m}^/^ 



AA(0,1), 



(37) 



where 70,™ = " ^0(1 " Hj)Tij. 

Proof First, classically, it is sufficient to prove that the convergence (37) holds up to consider 
a subsequence. Hence, we can assume that {H2) and the convergences 



m, 



"^Y.{l-Hi)il-Hj)T,^j^eo; 



(38) 



hold, with 6, 9q and 61 valued in [—1, +00]. Also note that since cx ro,m (resp. oc ri^m), 
condition (vanish-sccondordcr) and (ILi) also holds for the submatrix {Ti j)i j.jj.^H.^Q (resp. 
{^i,j)i,j:H,=Hj=i)- Now, let us write 



(— Fo,m(t) - vroFo(t)j = ro,„^m~^ ^(1 - Hi){l {$(X,) <t]-t)+ ro,„^^(mo/m - ttq). 

1=1 

(39) 



In the RHS of (39), while the second term converges to by assumption, a consequence of 
Theorem 3.1 is that the first term converges to a process with covariance function 



-I 



1 9 

r(t As-ts) + — ^— Cl(t)ci(s) 



Obviously, a similar result holds for the process ri^m{^^i,m — t^iFi)- 



for ah t,s E [0,1]. 



imsart-generic ver. 2007/09/18 file: DR2012-vl.tex date: October 10, 2012 



Delattre, S. and Roquam, E./Asymptotics of e.c.d.f. for Gaussian arrays 



14 



Applying the (partial) functional delta method as explained in Proposition C.2 (by using 
ro,m. oc ri,m and (34)), we obtain 



ro,m(FDPm - vroa) J\f (0 , (avro) 



(40) 



Finally, we easily derive (37) by separating the cases < +oo and = +oo. □ 

As an illustration. Corollary 4.1 can be used in the independent case (7o,m = 0) or p^-equi- 
correlated case (7o,m = Pm), so recovering the previous results of Ncuvial (2008, 2009) (in the 
Gaussian case) and Delattre and Roquain (2011), respectively. This holds for any H satisfying 
^/m{mQ/m — 'KQ) — t- 0. However, for an arbitrary covariance F, the quantity 70, m unfortunately 
depends in general on the unknown H and thus cannot be computed in a practical situation. 
This fact is inherent to the multiple testing setting considered here, because the dependencies 
accounting in the FDP of BH's procedure are related to the submatrix (Tij)ij-H.=Hj=o and 
thus are directly linked to the location of the null hypotheses. 

A convenient way to circumvent this problem is to add prior random effects, by assuming 
that, previously and independently to the model (27), we have drawn H = {Hi, . . . , Hm) 
for H\,H2,... i.i.d. Bernoulli variables of parameter vri = 1 — ttq, for some ttq G (0,1). 
Thus X follows the distribution J\f{pH, F) conditionally on H. The corresponding global 
(unconditional) model, often referred to as the two-group mixture model has been widely 
used in the multiple testing literature, see, e.g. Efron ct al. (2001); Storey (2003); Gcnovcsc 
and Wasserman (2004); Roquain and Villers (2011). By contrast with the previous model, H 
is now random. In particular, mo = Yl^ii^ ~ ^i) ~ B{m, ttq) and y/m{mQ/m — ttq) does not 
degenerate at the limit, which adds some extra variance in the FDP convergence result. The 
counterpart is that the statement is substantially simplified, as we can see below. 

Corollary 4.2. Consider the two-group mixture model defined above, generated from pa- 
rameters /X > 0, vTo G (0,1) and a correlation matrix F = T^"^\ Assume that F satisfies 
(vanish-secondorder) and (Hi). Let a E (0, 1) and t* = t*{fi,a) be the unique t G (0, 1) such 
that G{t) = t/a. Let h{t*) = ((/>(^>~i(t*))/t*)2. Then the sequence of r.v. FDPm defined by 
(31) enjoys the following convergence: 

AA(0,1), (41) 



vroa {{l/t* - TTo)/{7Tom) + h{t*)j^}^/^ 
where is defined by (2). 

Proof. Again, it is sufficient to state the result up to consider a subsequence. Thus {H2) holds 
without loss of generality. First check that (vanish-secondorder) entails 



^2 




/7,)(l-F,)F,,,-7ro^^F,,, I =op(l), (42) 



(computing, e.g., the variance of the latter) and this convergence can be made a.s. by taking 
a suitable subsequence. A consequence of (42) is that 7o,m ~ 7m a.s. (in particular, de- 
fined by (38) equals vro^.) This implies oc ro,m (a.s.), which provides (vanish-secondorder) 
and {Hi) for the submatrix {Tij)ij-H.=Hj=o (a.s.). Now, by using (39), we obtain that 
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fo,m [i^^o,mit) — vroFo(t)j Converges (unconditionally) to a process with covariance func- 
tion defined by: for all t,s £ [0, 1], 



4 



:(tAS-ts)+ Cl(t)ci(g) 



i+TToi^r l+^ol^l 



^ 1/^0 + 1^1 



4 



-(t A s - vrots) + — r^ci{t)ci[s) 



l + vro|6'P l + vrol 



Obviously, a similar result holds for the process ri^mi^^i,m — ttiFi). We finish the proof by 
applying the (partial) functional delta method, see Proposition C.2. □ 

Corollary 4.2 clearly shows that the concentration of FDPm around ttqu deteriorates when 
7m increases, so when positive correlations appear between the individual statistical tests. 
However, notice that, perhaps surprisingly, negative correlations help to decrease and can 
yields to a concentration even better than under independence when 7^ is negative (although 
this phenomenon is necessary of limited amplitude because 7^ > — 1/ tu) . 

From a practical point of view, (41) still contains the unknown quantities ttq and t*. For- 

tunately, t* can be easily estimated because T(Gm) — > T{G) = t* (by the Hadamard differ- 
entiability of T at G and by using the same arguments as in the proof of Proposition C.2). 
As for vTo, this parameter seems hard to estimate in our dependent context, but we can use a 
conservative uniform upper-bound. Finally, under the assumption of Corollary 4.2, we obtain 
the following asymptotic (1 — C)-confidence upper-bound for FDPm: 

sup |7roa(l + <I>-i(C){(l/t-^o)/(^o"i) + Mt)7m}'^')|, (43) 
7roe[o,i] ^ ^ ^ 

by denoting i = T{Gni) V (a/m) and h{t) = {(t){^-^ {t)) / tf for t G (0, 1]. 
5. Proof of Theorem 3.1 

5.1. A related result and additional notation 

Let us define the "modified" empirical distribution function by the following relation: for 
t G [0,1], 

rm{^m{t) -t)= rm(J,n{t) - t) + Ci(t)r„F„. (44) 

The convergence of the two processes rm(¥rn — I) and rm(^m — I) are strongly related by 
(44) . The main idea of our proof is to deduce the convergence of {^m — I) from the one of 
fm(^m — I)- Precisely, the following result will be proved together with Theorem 3.1 in the 
sequel. 

Proposition 5.1. Under the assumptions of Theorem 3.1, let us consider the corrected empir- 
ical distribution function defined by (44) and a continuous process (Zt)t<=[o^i] with covari- 
ance function K defined by (11). Then we have the convergence (in the Skorokhod topology) 

rm(Fm-/) ^Z/(l + |0|)i/^ asm^oo, (45) 

where I{t) = t denotes the identity function. 
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Additionally, throughout the section, we use the following notation 

ht{x) = l{<^{x)<t}-t-ci{t)x, (46) 

so that Fm(t) —t = X^I^i ht{Yi). Finally, we will sometimes use the following assumption: 

there exists 77 > (independent on m) lower bounding the m eigenvalues of T^^\ 

(eigenvalues- awayO ) 

5.2. Convergence of finite dimensional laws for ¥m 

Let us prove the following result. 

Proposition 5.2. Assume that the covariance matrix T depends on m in such a way that 
(vanish-sccondorder) holds with r^a defined by (3) and assume {H'l). Consider a continuous 
process (Zt)4e[o.i] with covariance function K defined by (11). Then, the process (rm(Fm — 

I),y/'"'*) (jointly) converges to £(Z/(l+|^|)-'^/^)®AA(0, 1) in the sense of the finite dimensional 
convergence. In particular, the convergence (45) holds in the sense of the finite dimensional 
convergence. 

Proof. The proof is based on central limit theorems for martingale arrays as presented, e.g., 
in Chapter 3 of Hall and Heyde (1980). 

First, since we aim at obtaining a convergence jointly with Y^'^\ a (somewhat technical) 
but useful task is to define the array of random variables {Y^'^\l < i < m,m > 1) is 
such a way that y^^™^ is fixed with m. This is possible by first considering some variable 
Z ~ AA(0, 1), by letting y^*-™"-* = Z for all m > 1, and then by choosing for each m > 2, the 
variables Y^™'\ 2 < i < m, such that 

- (Z,y/™\2 < i < m) ~ AA(o,r('^)); 

i<m,"T' > 2} is a family of mutually independent vectors conditionally on Z. 

This also define a common underlying space (fi, P) for the array of random variables. 
Now, define the following nested array of c-field: for m > 1, Gmfl = o"(0) and for 1 < i < m, 

Qm,i = ct{yP, l<j <iA£,l<e<m). 

Next, let us consider for each t £ [0, 1], the martingale array {Mm,i{t), Gm,i, 1 < i < m,m > I) 
defined as follows: 

i 

Mm,m = J2 Xm,j{t) for (t) = ^ {htiY}""^) - E (/it(y/™^) I amj-i)) • (47) 
Clearly, 

m 

rm{¥^{t) -t) = M^,m{t) + ^ J] E (/lt(y/'"^) I ■ (48) 

i=l 

Also note that we can replace each Qm,i by Fm,i = ■ ■ ■ i^/'"'*) (-^m,o = o'(O)) in the 

above expression, because {Y^^\ 2 < i < m) is independent of (Xj^\ 2 < j < iA£,2 < i < m), 
conditionally on Y^"^\ 
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Case 1: (cigenvalues-awayO) is assumed We show in Lemma A.l expression (68) that the 
second term in the RHS of (48) has a vanishing variance as m tends to infinity. Therefore, 
it remains to show that the conclusion of Proposition 5.2 holds for the process Mm,m, which 
we prove by using Lindeberg's theorem. We use Corollary 3.1 page 58 in Hall and Hcydc 
(1980) (or more precisely its generalization to the multidimensional case). The conditions are 
as follows: 

(i) for aU t G [0, 1], for all e > 0, Y:T=i ^ ((^w(i))'l {\XmAi)\ > ^1 I A 0; 

(ii) for aU t,s G [0, 1], ET=MXmAt)Xn.As) \ -F„,,_i) A K{t,s). 

To check (i), let us fix t G [0, 1] and prove YliLi IE(Xm,j(t))'^ = o(l). By definition, we have 

i=l i=l 

(4m 44m .\ 

1=1 i=l ) 

4^ m . 

i=l 

Now, the RHS of the previous display converges to zero because < \/rn and K{ht{Y^"^^))^ < 
00. This proves condition (i) of Lindeberg's theorem. 

Let us now turn to condition (ii). For t,s £ [0, 1], we obviously obtain 

m 2 

^E(X„,,(t)X„,,(5) I Tm,^-l) =^^E(/li(y/"^))/l,(y/'")) I Tm,^-l) 

i=l 1=1 

2 m 

- % VE(/ii(y/™)) I j-„,,_i)E(/i,(y/™)) I j-„,,_i). (49) 

i=l 

Next, by using ab < 2(a^ + b'^) for all a, 6 G M together with (67), the second term in the RHS 
of (49) tends to zero in probability. Moreover, we have 

i=l ^ 

4 m 
777. 

t=l 

because the elements inside the sum are martingale increments. Hence, the quantity inside 
the above display tends to zero. Combining the latter with (49) establishes condition (ii) of 
Lindeberg's theorem provided that the following holds: 

2 ™ 

Y: hiYl-^)hAY}"'^) A (1 + \e\)-'Kit, s). 

1=1 

This comes directly from the law of large number stated in Lemma C.3, because rf^/m — )• 
{l + \e\r^ by (3) and (ifs). 
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Applying Lindeberg's theorem (in the underlying space described above), for any ti, . . . ,tk G 
[0, 1], the random vector 

Zm = {Mm,m.{tl), • ■ • , Mm,m{tk)) 

converges stably in the following sense (see, e.g., Jacod and Shiryaev (2003) Definition 5.28): 
for all (fixed) bounded random variable U and continuous bounded function / in M'^, 

E{Uf{Zm)) ^ E(C/)E(/(Z)) as m ^ oo, 

where Z is a centered multivariate Gaussian vector with covariance (l+|^?|)^^(i^(tj, tj))i<jj<fc. 
This implies that {Zm, Yi) converges (jointly) in distribution to C{Z) (g) AA(0, 1). This finishes 
the proof of Proposition 5.2 in the case where (eigenvalues-awayO) is assumed to hold. 

Case 2: (eigenvalues-awayO) is not assumed The strategy is to apply Lemma C.4 in order 
to reduce the study to "Case 1" above. For any e > 0, let 



(1 + £2)1/2' 



where ^i, ^2; • • • are i.i.d. A/'(0, 1) variables, independent of all the Y^'s. The covariance matrix 
of (y/, . . . , Y^) is obviously 



1 + e' 



Clearly, the corresponding rate (3) is 



[m 



^ + {1 + e"^) ^ |7-m|) It is related to via 



the following inequalities: < < (1 + e^Y^'^Vm- Hence, satisfies (vanish-secondordcr) 
and {H'l) with 6 replaced by 6'^ = j^O. Since it also satisfies (cigcnvalues-awayO) , by using 
Proposition 5.2 in the "Case 1" above, it satisfies for any ti, . . . ,4^ G [0, 1], 



(a) (r^(F^(ti)-ti),...,r^(F^^(tfc)-tfc),y/; ^ ^ 
where F^(f) — t = m^^ Z^ilLi ht{Yi) for all t. Next, we clearly have 



)AA(0,1) 



Let us now prove that for any t G [0, 1], 

limsupjE rm(Jm{t) - t) - r^(F^(t) - t) | ^ as e ^ 0. 
This will conclude the proof by applying Lemma C.4. First, we write 



(50) 



E 

< E 

< < 



r„(F„(t)-t)-r^(F^^(t)-t) 



rmlmY,{ht{Yi)-ht{Yl)) 

i=l 

/ m 

{rJmf¥.[Y,{hm-ht{Yl)) 



1/2 



i=l 

+ ((1 + £2)1/2 r^(F^(t)-t) 
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By taking the limsup in the above display, it only remains to show 



liinsup I {rm/mf E [^2^^^) " ht{Yf)) j | ^ as e ^ 0. (51) 
This can be proved by using Lemma B.3 (80) as follows: 

(m \ 2 

m 

= {rmlmf ^ E ((/it(y,) - ht{Y!))(ht{Yi) - ht{Y-))) 

m 

= (Tmlmf (E (ht{Y.^ht{Y.j)) - E {ht{Y,)ht{Yf)) - E {ht{Yl)ht{Y,)) + E (/it(y/)/it(y/))) 



(r„/m)2 E E ^(r..)^ (l + (1 + e'Y' " 2(1 + e^Y'!' 



i,j=l i>2 



because Cov(yi,yj) = Tij, CoY{Yf,Yj) = Cov{Yi,Yf) = Tij/{l + eY^'^ and Cov(y/,y/) = 
Tij/ (1 + e^). Next, by separating the case i = j and i ^ j, the previous display can be upper 
bounded by 

E ^ |i + (1 + '"r' - + + E(rM)^ X 4 Mil!. 

£>2 ■ i^j £>2 

While the first term above does not depend on m and converges to zero as e — )■ 0, the second 
term above as a limsup^ equal to zero by (vanish-secondorder) . This implies (51) and finishes 
the proof. □ 

5. 3. Convergence of finite dimensional laws for ¥m 

In this section, we aim at proving the following result: 

Proposition 5.3. Consider the assumptions of Proposition 5.2. Then, (12) holds in the sense 
of the finite dimensional convergence. 

Proof. From expression (44), we investigate the (joint) convergence of (r.m(Fm — I),rmYm)- 

Case 1: 6 = —1 In that case, Var(ym) — )• 0. Hence, we can directly use Proposition 5.2 
to state that {rm{^m — I), fmYm) converges to /2(Z/ (1 + |^|)^'^^) ® 5q in the sense of the finite 
dimensional convergence. This establishes Proposition 5.3 in that case. 

Case 2: 6 > —1 Now, r^Var(ym) is converging to some positive real number, namely 
{1 + 9)/{l + \9\) > 0. In particular, Var(ym) > for m large enough. Let us define the 
random variable 

yo =F™(Vary™)-i/2. 
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We now consider the (m + l)-dimensional random vector (li)o<j<m) which is centered, with 

a covariance matrix denoted A^*""*"-^) = {^i^~^^^)o<i,j<m, and such that Ag™"*"^^ = 1, A-'^"''^^ = 

r^'J^ for 1 < i,j < m. We easily check that A^"^^^^ satisfies (vanish-secondorder) and 
{H2) with the same value of 9 and a rate asymptotically equivalent to the original r^, see 
Lemma A. 2. Hence, Proposition 5.2 shows that (by using notation therein), 

(^rm (^{m + ir^f^htiYi)^ ,Yo^ ^ £(Z/(1 + |0|)i/2) ^ AA(0, 1), 

in the sense of the finite dimensional convergence. Since rmht{YQ)/m tends to zero in proba- 
bility, the last display can be rewritten as 

^r„(F™ - /),F„(Vary„)-i/2^ ^ £(Z/(1 + \e\)^/^) ® AA(0, 1). 
Finally, since r'^Yai{Ym) — ^ (1 + ^)/(l + 1^*1)) we finish the proof by applying (44). □ 
5.4- Tightness 

To complete the proof of Proposition 5.1, we now prove that the process Xm = rm{¥m — 
I) is tight in the Skorokhod space, under the assumption of Theorem 3.1. This will also 
imply tightness for rm(F,m — I) by (44) because ci is a continuous function on [0, 1]. Hence 
Theorem 3.1 will be proved. 

For proving the tightness of Xm, we use Proposition C.l. This is possible because |ci(t) — 
ci(s)| < L\t — s|^/^, < s,t < 1 for some constant L > 1 (see Lemma B.4). Below, we prove 
that (82) holds in the following way: for large m, 

E\Xm{t)-Xmis)\''<C{\t-sf/^ + {rm)-''>\t-s\), forant,sG [0,1], (52) 

for some constant C > and for a constant Eq > such that {Hi) holds. 
To establish (52), fix t, s G [0, 1], s < t and write 

E\Xm{t) - Xm{s)\^ = ^Y. Hh(Y^)h(Yj)h(Yk)h{Ye)), (53) 

where we let h{x) = 1 {s < $(x) < t} — {t — s) — (ci(t) — ci{s))x = ht{x) — hs{x). Now, we 
split the sum in the RHS of (53) following the value of the cardinal of k,£}. 

Sum over j, k,i} = 1 The corresponding summation is ^ Yl^Li ^ {(^0^))^) ■ We have 

E {(h{Yi) f) < 3^ (\t -s\ + \t- + E(|yi|^)L^|t - sj^/^^ < Ci\t - s\, , (54) 

for Ci = 5 3^L^ > 0. Since < m, we obtain 

4 m „ 

"^.E^mYm<^\t-s\. (55) 

i=l 
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Sum over ^{i,j,k,i} = 2 Up to a multiplicative constant, we should consider the sum 



r, 



where, for an arbitrary r/i > 0, T^^^ and Tg^'' are defined by 

4 

.(1) _ ^ 



^1 = 3 E 1 {I^mI > '?i}IE {{h{Y^f{h{Y,)f) ; (56) 



m 



.(1) _ '^m 



5] 1 {|r,,,| < ,?i}E {ihiY^)\hiY,)f) . (57) 



On the one hand, by using (54), 



On the other hand, by using (76) in Proposition B.l (with 5i = 52 = (/i)^ and d = 2), we 
obtain that for any i ^ j such that |rjj| < rji (choosing r?i > such that 2-y/3?7i < 1), 

for C2 = 3^L^ (E (|Z|8/3))^/^ G (0,00) . Hence, we get 

T«<- ^-|t-.|3/2. (59) 

2 -(1-2^3771)2' ' ^ ' 



Sum over #{i,j,k,£} = 3 Up to a multiplicative constant, we should consider the sum 

m 



^ E{h{Yi)h{Y,){h{Y,)f) =tI''> 



where, for an arbitrary r/2 > 0, T^'^^ and Tg^^ are defined similarly to (56) and (57), by 
separating the case where maxe^-^e2e{j,i,fc} 1^61,62! is above or below 772- 
On the one hand, by using (54), we have 



4 



^(2) ^ Cir^m 



V 1<^ max jrei.eal > Mi - s| 



o4 / ^4 



On the other hand, by using (77) in Proposition B.l (with gi = 52 = 53 = (/i)^, /i = 
/2 = l{s < <!>(•) < t}, d = 3 and d' = 2), we obtain that for any distinct /c such that 
maxe^-^e2e{j,j,fc} 1^61,62! ^ ^2 (choosing 772 > such that 3-^/3772 < 1)) 

'^-s|3/2x^/C2. 



E(/i(y,)/i(y,)(Myfc))2) < max |re„e2r--^ 



^3 1 
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This yields 

(1 - 3V3??2)^ I m-^ 



Sum over ^{i,j,k,i} = 4 The last sum to be considered is 



r. 



IE {hiY,)hiY,)hiYk)hiYe)) = + Tf , 

where, for an arbitrary r/3 > 0, T-|'^'' and Tg^^ are defined similarly to (56) and (57), by 
separating the case where maXe^-^e26{ij,fc,^} 1^61,62! is above or below 7^3. As before, 

ri.(3)^Cirt y il max \Te,e2\>V3]\t-s\ 

Next, by using (77) in Proposition B.l (with gi = h, fi = 1 {s < <!>(•) < t} and d' = d = 4), 
we obtain that (choosing 773 > such that 4-^3ry3 < 1), 



" m4 . ^ ei^e2G{M,M}' " (1-4^3%)^ 



s«7T^4^^Eir..rii«-i'- (63) 



;i - 4\/3^)4 I m2 

Finally, we obtain (52) by combining the bounds (55), (58), (59), (60), (61), (62), (63) and by 
using the assumptions (vanish-secondordcr) and {Hi). 

5.5. Proof for Remark 3.3 

—1/2 

Obviously, (-f^s) and {Hi) imply (vanish-secondordcr), (-^2) with 9 = +00, and ~ 7m • 
Hence, Proposition 5.2 entails that the finite dimensional laws of = fmi^m — I) converge 
to and it only remains to prove that X^n is tight. This can be done as in the previous section, 
except that we use k = 2 in Proposition C.l. Namely, we prove that, for large m, 

¥.\Xm{t)-X^{s)\^ <C-f^^\t-sl forant,sG [0,1], (64) 

for some constants C > 0, (5o > 0. To prove (64), we write (by using the same notation as in 
the previous section) 

2 

^Xra{t) - X,n{s)\^ =!^J2Hh{^MYj)) 

<c/^\t -s\ + c^\t - V |r,,,f 



+ ^Y.^{\Tij\<r,}E{h{Y,)h{Y,)) 



m 
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for some r/ > and by letting C3 = 4 3^L^ > 0. Applying now (77) in Proposition B.l (with 
gi = h, fi = l{s < <&(•) < t} for i = 1,2 and d' = d = 2), we obtain that (choosing r] > 
such that 2^/3r] < 1), 



m 



J2i{\r,,,\<rj}E{hiYi)hiY,)) 



< 



(12)^ 
(1 - 2^)2 



m 



2 



2 

Finally, since (i^s) and (Hi) provide ^ Sj^j = 0(7,*^) and r'^/m = 0(7^) for some 

£1,52 > 0, the criterion (64) is proved with 60 = ei A £2 and the proof is finished. 



Appendix A: Technical results for proving the main theorem 

Lemma A. 1. Assume thatV^^^ satisfies (vanish-secondorder) and (eigenvalues-awayO). For 
1 < i < m, let us consider the filtration {J-i}o<i<m, defined by J-q = cr{9) and Ti = 
(t(Yi, . . . ,Yi), and denote af = Var [E {Yi \ J^i-i)]. Consider the function ht{-) defined by (46), 
the Hermite polynomials Hi{-) defined by (71) and the coordinates q(-) defined by (5). Then 
the following holds: 



' n 

m 



' n 

m 

E 



1=1 

^ (E [E (y, I Ti-i) E {Yj I -F,-i)])' ^ 0; 

id 
m 

^E [(E {ht{Yi) I -Fi-i))'] ^ 0, for any t e [0, 1]; 



1=1 

( ^JUL 
\ m 



i=l 



0, for any t G [0, 1]. 



(65) 
(66) 

(67) 
(68) 



Proof. By using Cholesky's decomposition, we can write T = RR^ where R is m x m a lower 
triangular matrix. Hence, denoting by -Ri^,, . . . Rm,. the lines of R, we have < Ri^,, Rj^, >= Tij 
for all i,j. Moreover, since we can write Yi = J2]=i fo^ some Zi, . . . , i.i.d. A/'(0, 1), 

we have i?^ . = Var(yi \ Ti-i) = 1 - af and af = ^Ij for ah i. 

Let us now prove (65). From (cigcnvalucs-awayO), we have for all x G M™", HiJ-'^xlp = 
x^Tx > ry||x|p. Hence for all x G M"*, > r/ljxlp. Thus, we have 

m J— 1 m j—1 m 

= EE ([^^]^)' > ^EE4. = vE-l 

i<j j=l i=l j=l i=l i=l 

which proves (65) by (vanish-secondorder). As for (66), we have for i < j, 



E[E(y, I Ti-i)E{Yj I Tj-i)]=E 



i-l 



i-1 

E ^i,kZk E ^j/^t 
.k=l £=1 



i-l 



k=l 
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(E [E (y, I E (y, I = YiVi,, - R^,iR,,f < 2 r2, + ^ fl^ . , 



which estabhshes (66) by (65) and (vanish-secondorder) . 

Next, let us estabUsh the fohowing equahty in Lp'l^rn)'- for any i = 1, . . . , m and t € [0, 1], 



E(/^,(y,)l-^.-i) = E^-'^^ 



i>2 



^{Yi I J-,_i) 



(7,; 



(69) 



where the RHS of (69) is if cxj = 0. For this, consider some 1 < i < m and assume cxi > 
(otherwise the result is obvious). Let Yi = ^(^' ~ -^(0, 1). By using the multivariate 

Gaussian structure of Y , the distribution of Yi conditionally on Fi-i only depends on Yi. 
Hence, we can write E {ht{Yi) \ Fi-i) = g{Yi) for a (unique) function g in ^^(M, A/'(0, 1)). We 
now consider the expansion of g w.r.t. the Hermite polynomials in that space: 



g 



n9{Yi)H,{Yi)) 



and we can compute each coordinate K{g{Yi)H£{Yi)) in the following way: for any £ >0, 
E 



H,{Yi)E{ht{Y,) I \ = E [HeiY,)ht{Yi) 

'■fit] 
(f)! 



l'>2 

ce{t) 



aUll{i>2}, 



by using Fubini's theorem (because ^^,>2 ^t^^ \Hi{Y)He'iYi)\ < (^!)^/^E^' 

do), and by applying (72) with Cov(yi,y) = ai. This proves (69). 
Finally, by using (69), (72) and notation above, we have 



\ce'(t)\ 
'>2 (i'\)i/2 



< 



E 



m 



i=l 



r 

' n 

m 



EE[E(/it(y) I Ti-i)E{ht{Yj) I Tj^i)] 



r, 



2 

m 

ij £>2 £'>2 ^ 



EEE 



Q(t)Q/(t) ^ 



H,{Yi)He{Yj) 



r. 
m 



2 ^ ^ Q(t)^ 



< 



E 

£>2 



ij £>2 



afa'^ (e 



E(E[E(y, I j-,_i)E(y, I j-,_i)])^ 



which proves (68) by using (66). Exactly the same calculation with "i = j" shows (67) from 
(65). □ 
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Lemma A. 2. Assume that r^™) satisfies (vanish-secondorder) and that r^\ai{Ym) con- 
verges to some positive real number. Consider the (m + 1) x (m + 1) covariance matrix A^'^'^^^ 
of iYi)o<i<m defined in Section 5.3. Then the rate 



rm+i 



j-^(m+l) 



(m + l)-2 ^ A, 



(m+l) 



-1/2 



satisfies rm+ii-A^™'^^^) ~ rm and moreover 



.{m+1) 



o(l). 



(70) 



In particular, A^*""*"^) satisfies (vanish-secondorder). Finally, when {H-j) holds for r*^™), it 
also holds for K^"^~^^\ with the same value of 6. 

Proof. By definition, 



Since Aqj = (Var y^) ^^'^m ^ Y^^i we have 



Ao,i = (Vary„)-i/2jn-im-2 ^ Tij 



1/2 



l<i,jr<m 



which is o{l/m) because F satisfies (vanish-secondorder) and thus (A). This imphes rm+i(A) 
rm- Next, we estabhsh (70). Let us write 

[m+l)-\l Y i^^,jf = im + lr'rl^ Y (r^f + 2 J] (Ao,,)^ 

Furthermore, we have 

Y (Ao,,)' = (VarF„)-i Y ("^"'Et,,) 



< (Vary„)-im-2 ^ 2ri,i, + ^ FijF,/j 

l<i,i'<rri \ 

<2+(mVarF„)-i (F^^f . 

i<*7^i<»Ti 

This imphes the result, because mYarYm > ^'m^^^^m; which is bounded away from by 
assumption. □ 
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Appendix B: Results related to Hermite polynomials 

Let us first recall that the sequence of Hermite polynomials Hi{x), i > 0, x € M, is defined 
by the expression: for all i > 0, 



Vx G M, 4>^''>{x) = {-lYHe{x)(f){x), (71) 



where 4>{x) = (27r)~"'^/^ exp(— x^/2) is the density of a Gaussian standard variable and 
denotes its ^-th derivative (by convention, (f)^^^ = (p). For instance, we have Ho{x) = 1, 
Hi{x) = X and H2{x) = — 1. 

A weU known fact is that {Hi{-) / {11)^/'^ J > 0} is an Hilbert basis in L^{R,M{0, 1)), the 
Hilbert space composed by square integrable functions w.r.t. the standard Gaussian measure. 
Moreover, the following property holds: for any centered 2-dimensional Gaussian vector (U, V) 
with = = 1^ 

y£,e' >0,£^ £', E{He{U)He'{V)) = {Coy{U,V)Y £1 6e,e>. (72) 

The latter can be seen as a consequence of Mehler's formula, itself being nicely presented in 
Foata (1981) (1.4) (see also references therein). 

Proof of Proposition 2.1 Let us start by expanding, for any t G [0,1], the function 
1 {^i') < t} w.r.t. the Hermite polynomial basis in L'^{R,Af{0, 1)): 

i{H-)<t} = Y,cdm{-)/i£\). (73) 

By applying (73) at Yi, we obtain the following expansion in L^(Pm): for all i = 1, . . . , m, 

i{$(y,) < t} = Y,c,mm/{£^. (74) 

By averaging w.r.t. i, we obtain 

Q(t) 
£\ 

t>l i=l 



where the series in the RHS of (75) converges in L^(Pm) (by using the triangle inequality). 
The proof is finished by combining (75) with (72). 

Next, the following proposition shares some similarities with Lemma 4.5 of Taqqu (1977) 
and Lemma 3 of Csorgo and Mielniczuk (1996). 

Proposition B.l. Consider an integer d > 2, a positive number p such that \fipd < 1 
and Z ~ ^"(0,1). Let gi,..-,gd be d measurable real functions defined on M such that 
E (|g(j(Z)|^/'^) < +00, 1 < i < d. Let {Ui, . . . , Ud) be d-dimensional centered Gaussian vector 
with EC/j? = 1, 1 < i < c^, and |E(C/jC/j)| < p, I < i j < d. Then the following holds: 



E 



.i=l 
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Furthermore, ifK{gi{Z)) = and E,{Zgi{Z)) = for 1 < i < d' for an integer d' , 1 < d' < d, 
we have 



E 



.1=1 



(1 - ^pdY 



3/4 



(77) 



where fi is any function such that fi{x) = gi{x) — ai — fiiX, x G M, Oj, j3i G M, for 1 < i < d' 
and fi = Qi otherwise. 

Proof. The Kibble-Slepian formula Kibble (1945); Slepian (1972) (given, e.g., in expression 
(2.2) of Foata, 1981) provides that 



E 



\{9^{Ui, 



.4 = 1 



.^g,{Z)H,^{Z) 



u i<j 



i=l 



En 

u i<j 



{E{UiUj)y 



,llEig,iZ)H,^XZ)), 



(78) 



i=l 



where the summation is over all the dxd symmetric matrix u = {vij)i<ij<d with nonnegative 
integral entries and with diagonal entries equal to zero, while fj. denotes vn + ■ ■ ■ + Vid- 
Above, we have implicitly used Fubini's theorem (the summation over v is infinite). The next 
calculations show that this is indeed valid: by using the assumptions, we have 



u i<j 



< 



i=l 
1/2 



E\g,{Z)H,^XZ)\ 

\ 1/2 



s E n 

xi,...,xaeN<i 



1 vrii^i. 



E\giiZ)H,jZ)\ 



n 

i=l 
d 

n 



E 



P 



yi-\ — \-yd 



1/2 



^9^{z)H, 



J/lH hj/d 



1/2 



S/lH l-yd=^ 



(79) 



Now, in the latter display, the sum over y is upper bounded by d^, which gives that the RHS 
of (79) is upper bounded by 



Y^ipd^-'^W. g,{Z)He{Z)/{i\)y' < {Yi^pdY'](^(\m{zr^' 



.e>o 



3/4 



where the latter combines Holder's inequality with Lemma B.2 (used with p = 4). This proves 
(76) and shows that Fubini's theorem can be applied to get (78). 
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Finally, we prove (77) by using (78) and the same calculations as above, except that the 
absolute values should be kept outside the expectations. As a result, for 1 < i < d', since 
K{gi{Z)Hi{Z)) = for ^ = 0, 1 by assumption, the corresponding sums over £ start at £ = 2. 
This establishes (77), because for ah £ > 2 and 1 < i < d', E{gi{Z)H(,{Z)) = E{fi{Z)Hi{Z)). 

□ 

The following result was obtained in the proof of Lemma 3.1 in Taqqu (1977). We provide 
an elementary proof below. Also, let us mention that there are more accurate such results 
when i grows to infinity, see Theorem 2.1 in Larsson-Cohn (2002). 

E(^He{Z)/VI^Yf^'' <{p-lY/^ 



Lemma B.2. For all even integer p > 2 andi > 0, we have 
/or Z~AA(0,1). 

Proof. For some ^ > 1, by using H'^ = iHi^i and (71), we obtain 



= e{p -I) j [H,{x)f-\H,^i{x)f<i){x) dx. 

Next, by using Holder's inequality, we get {J[Hi{x)f(t){x)dxf'^ < i{p-l) (/ \H^^i{x)\P(t){x)dx) 
and the result is obtained by induction on □ 

Lemma B.3. Consider the function ht{-) defined by (46) and q(-) defined by (5). Let us 
consider a two-dimensional centered Gaussian vector {U,V) with EC/^ = KV'^ = 1. Then for 
any t, s E [0, 1], the following holds: 

E{h,{U)K{V)) = Y,'-^^^^{Cov{U,V)Y. (80) 

e>2 

Proof. Expression (80) is a direct consequence of (72) and of Fubini's theorem. □ 

Lemma B.4. The function ci(-) = satisfies the following: for all v G (0, 1), there 

exists some constant Cu > such that for all s,t £ [0, 1], 

\ci{t) - ci{s)\ < C,\t - s\^-r (81) 

Proof. First note that the derivative of ci on (0, 1) is <I>~^. Classically (see, e.g.. Lemma 12.3 
of Abramovich ct al. (2006)), there is some xq £ (0,1/2) such that for any u £ (0,Xo), 
$~^(n) < y^2 log(l/n). Also, obviously, for some fixed u > 0, there is some > such 
that for any u G (0, xq), -y/21og(l/ii) < Clu~'^. As a consequence, since \^~^\ is bounded on 
[xq, 1 — xq], there exists some constant C" > such that for all u G (0, 1), |$~"'^(n)| < C'lu"^ . 
This entails that for all < s < t < 1, 

|ci(t) - ci(s)| < r \^'^(u)\du < -^{t^-'' - s^"") < C^{t - sY~'' 

by letting Cv = C"/(l — v) > Q and because (x + yY < x^ + y^ for any x,y > and any 
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Appendix C: Useful auxiliary results 

The following result can certainly be considered as well known, although we failed to find a 
precise reference for it. It can be seen as a reformulation in our framework of classical tightness 
results as given, e.g., in Lemma 2 of Csorgo and Mielniczuk (1996), in Remark 2.1 of Shao 
and Yu (1996) and Proposition 6 of Dedecker and Prieur (2007). 

Proposition C.l (Tightness criterion for empirical distribution function with non-stan- 
dard scaling parameters). Consider (,i,...,^m real random variables (that need not to be 

— P — 
independent or identically distributed) such that — > Q as m tends to infinity, for = 

X^ilLi im, and consider the process 

m 

Zm{t) = iam/m)Y,9t{^^), for t G [0, 1], 

i=l 

where (am)m is some positive sequence tending to infinity as m tends to infinity and where 
gt{x) = l{^{x) <t}- fo{t) - fi{t)x for functions /o,/i on [0, 1] such that \ fo{t) - fo{s)\ V 
\fi{t) — fi{s)\ < L\t — s\'^, < s,t < 1, for some q £ (0,1] and L > 0. Assume that the 
following holds: for large m, 

E|Z„(t) - Z„(s)|" < C{\t - s\^^ + {am)-^^'''\t - s^'), for all t, s G [0, 1], (82) 

for constants k > 0, C > 0, 5i > 1, g' G (0, 1] and 62 > 1 — q' . Then, as m grows to infinity, 
the sequence of processes i^Z^f^^^ is tight in D(Q, 1) (endowed with the Skorokhod topology and 
the corresponding Borel a -field) and any limit is a.s. a continuous process. 

Proof. The proof is based on standard arguments and is similar to the proof of Theorem 22.1 
in Billingsley (1968). Fix e G (0, 1) and 77 > 0. Following Theorem 15.5 in Billingsley (1968), 
it is sufficient to prove that there exists a 5 G (0, 1) such that for large m, 

/ \ 

sup \Zjn{t) - Zm{s)\ > e <r]. 
0<s,t<l 

\\s-t\<5 J 

We merely check (see, e.g., the proof of Theorem 8.3 in Billingsley (1968)) that the latter 
holds if there exists 6 G (0, 1) such that for large m, 

Vs G [0, 1], P I sup |Z„(t) - Z„(s)| >e] < r]6. (83) 

\t:s<t<{s+5) Al J 

Let us now prove (83). Fix s G [0,1]. Assumption (82) entails that for all u,v £ [0,1] such 
that (f — u)'^ > e/am, we have 

E\Zm{v)-ZMr ^ J^l^-^l^' 

for 63 = 5i A {q' + 82) > 1. Hence, if p > is such that p'^ > e/om, applying Theorem 12.2 of 
Billingsley (1968) we have for all integer M such that s + Mp < 1 and for all A > 0, 

max^ \Z^is + ^p) - Z^l > a) < J^.iMp)'' (84) 
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for some positive constant K > (only depending on ^3, k and C). Next, we use the following 
inequality: for all < u,v < 1, u < v < u + p, 

\Zm{v) - Zm{u)\ < \Zmiu + p)- Zm{u)\+2La„,p'iil + \lJ). (85) 

The latter holds because we have 

m 

Zm{v) - Zm{u) = {am/m)^l{u <ii<v}- am{fo{v) - fo{u)) - am{h{v) - fi{u))im 

i=l 

< Zm{u + p)- Zm{u) + 2La^p''{l + \U) 

and Zm{u) - Zm{v) < am{fo{v) - fo{u)) + am{fi{v) - fi{u))Cm < LamP'^il + l^^l). 
Now, by using (85), we obtain 

sup \Zm{t) - Zmis)\ < 3 max \Zmis + ip) - Zmis)\ +2Lamp''{l + (86) 

t:s<t<s+Mp l<i<M 

Furthermore, provided that amP"^ < 2e, we have P(2LamP^(l + \lm\) > ^L^) < IP(l?ml > 1/4)- 
Hence, combining (84) and (86), by taking 5 G (0, 1) such that V^"^^^'''^ < ^/2, we will 

obtain that for all s £ [0, 1 — 5], for large m, 

P ( sup \Zm{t) - Zm{s)\ > (3 + 5L)e] < -^^5'^ +F{\U > 1/4) < 

\t:s<t<s+S J e«+'^2/9 

as soon as we can choose p > and an integer M such that Mp = 5 and e/am < < 2e/am- 
This holds if there exists an integer into the interval [(5(am/e)^'''^, 5(«m/ (2e))^^'^], which is true 
for large m because am tends to infinity. This entails (83) with e replaced by (3 + 5L)e and 
the proof is finished. □ 

Proposition C.2 (Partial functional delta method on D(0,1)). Consider the linear space 
D{0, 1) of cad-lag function on [0, 1] and the linear space C(0, 1) of continuous functions on 
[0,1]. Let e = {60, 9i) e -D(0, 1)2. Let cj) : D{0,iy ^ R be Hadamard differentiable at 9 
tangentially to C(0, 1), w.r.t. the supremum norm, and such that the derivative is of the form 

MHo, Hi) = ge{Ho), for any {Ho, Hi) e C{0, 1)^, 

for a continuous linear mapping gg : C(0, 1) 1— )• M. Consider Zo^m; ^i,»n; m > 1, processes 
valued in D (0,1) andZo, Zi two processes valued a. s. mC(0, 1). Assume that the two following 
distribution convergences hold (w.r.t. the Skorokhod topology and the corresponding Borel a- 
field), for some positive sequence {am)m tending to infinity: 

am{'^0,m — ^0) 
am(^l,m — ^'l) ^1- 

Then we have 

^i,m) - m) ^ 9e{^o). (87) 
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Proof. Classically, let us show that for any subsequence {n} there exists a further subsequence 
{£} such that (87) holds along this subsequence. For any {n}, since both processes a„(Zo,n — 
^o) and an(^i,n — di) are (Skorokhod-)tight, the joint process (a„(Zo,n — ^o)7 0n(^i,n — ^i)) 
also is. Hence, by Prohorov's theorem, there exists a further subsequence {£} such that 
(a£(Zo/ — 9o), ae{7^i/ — 9i)) converges in distribution. Now applying the Skorokhod's repre- 
sentation theorem (see, e.g.. Theorem 6.7 page 70 in Billingsley (1999)), there exists random 
elements Ti = (To_£,Ti^£), £ > 1, T = {Tq,Ti), defined on a common probability space, such 
that C{Ti) = C {aiiZo^i - 9q), a^(Zi^^ - 6*1)), -C(ro) = C{Zo), C{Ti) = £(Zi) and Ti converges 
a.s. to T. Since both Tq and Ti belong to C7(0, 1) (a.s.) and since any sequence of cad-lag func- 
tions converging (w.r.t. to the Skorokhod distance) to a continuous function also converges 
uniformly, we obtain 

ll^b/ - Tbiloo + - Tilloo a.s. 
Hence, the Hadamard differentiability of (j) entails: 

+ m) - m 



?(To) a.s. 



for any sequence — )■ 0. By taking tn = l/a£, we derive (87) along the subsequence {^}, which 
proves the result. □ 

Lemma C.3. Assume that T satisfies (A). Then for any /i : M — t- M measurable such that 
K\h{Z)\ < 00, we have 

m 

m"^ J]]/i(yi) Ae[/i(Z)], /or Z~AA(0,1). (88) 

i=l 

Proof By Section 2, Assumption (A) implies that Vt G [0, 1], Fm(t) A t. Since h £ L^{R,M{0, 1)), 
for any e > 0, there is a continuous bounded function h^ such that E|/i(Z) — hf,{Z)\ < e. 
Moreover, by definition of the weak convergence, (88) holds for h = hf, (for instance, the 
convergence in probability can be seen as an a.s. convergence up to consider subsequence). 
Since we have 



sup < E 

m>l 



'Y.{h{Y,)-K{Y,)) 



m 

i=l 



< snp {m-'Y^E \h(Y,) - he{Yi)\ } <e, 
rn>i[ U J 

we can conclude by using Lemma C.4. □ 

The following lemma is classical, see, e.g.. Theorem 4.2 in Billingsley (1968). 

Lemma C.4. For n > 1 and e > 0, let X^, X„, A"^, X be real random variables (Xn and 
X^ being defined on the same probability space) and such that 

(a) \/e > 0, X^ as n ^ 00; 

(b) X^ X as e 0; 

(c) limsup„_j.o^{E|A^ — A„|} — )• as e — 0. 
Then A„ X . 
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